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SUMMARY 

Certain experiments contemplated for space platforms must be isolated from the accelera- 
tions of the platform. In this paper an optimal active control is developed for microgravity 
vibration isolation, using constant state feedback gains (identical to those obtained from the 
Linear Quadratic Regulator [LQR] approach) along with constant feedforward (preview) gains. 

The quadratic cost function for this control algorithm effectively weights external 
accelerations of the platform disturbances by a factor proportional to (l/w) . Low frequency 
accelerations (less than 50 Hz) are attenuated by greater than two orders of magnitude. The 
control relies on the absolute position and velocity feedback of the experiment and the absolute 
position and velocity feedforward of the platform, and generally derives the stability robustness 
characteristics guaranteed by the LQR approach to optimality. 

The method as derived is extendable to the case in which only the relative positions and 
velocities and the absolute accelerations of the experiment and space platform are available. 


1. INTRODUCTION 

A space platform experiences local, low frequency accelerations (0.01 to 30 Hz) due to 
equipment motions and vibrations, and to crew activity (ref. 1). Certain experiments such as 
the growth of isotropic crystals, require an environment in which the accelerations are less than 
a few micro-g’s (ref. 2). Such an environment is not presently available on manned space 
platforms. 

Since the experiment and space platform centers of gravity generally do not coincide, a 
means is needed to prevent a free-floating experiment from drifting into its own orbital motion 
and into the space platform wall. Additionally, most experiments will require umbilicals of some 



sort to provide power, experiment control, coolant flow, communications linkage, and or other 
services. Unfortunately, such measures also mean that unwanted platform accelerations will be 
transmitted to the experiments. This necessitates experiment isolation. Passive isolators, 
however, cannot compensate for umbilical stiffness, nor can they achieve low enough corner 
frequencies even if umbilicals are absent. Therefore, it is essential to compensate this environ- 
ment with active isolation means. 

The problem then becomes to design an active isolation system to minimise these 
undesired acceleration transmission, while achieving adequate stability margins and system 
roubustness for the controller design. In addition, spatial and control energy limitations must 
also be accommodated. This paper will explore the optimal control problem of a microgravity 
experiment isolation from the low frequency range of disturbances experienced on the Shuttle 
and in the future Space Station Freedom Microgravity Modules. 


2. MATHEMATICAL MODEL 

The general problem has three translational and three rotational rigid body degrees of 
freedom. For simplicity, however, this analysis will consider only the one-dimensional problem. 
The general problem could be treated in an analogous manner. Let the experiment be modeled 
as a mass m, with position x(t). Assume that the space platform has position d(t), and that 
umbilicals with stiffness k and damping c connect the experiment and space platform. Suppose 
further that a magnetic actuator applies a control force proportional to the applied current i(t), 
with proportionality constant a. Such a model is shown in figure 1. 

The system equation of motion is 

mx + c(x - d) + k(x - d) + ai 0 ^ ^ 

Division by m and rearrangement yields 


x 


(x - -1) - I (i - d) - -i 

m m m 


In state space notation this becomes 


x = Ax * bu + f 


( 2 ) 


( 3 ) 


where 
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The objective is to minimize the acceleration X(t). 


3. OPTIMAL CONTROL PROBLEM 

The optimal control problem is that of determining the control current u(t) = i which 
minimizes a suitable performance index 

J = J(x,u,t) W 


for the system described by equation (3) subject to the state variable conditions 

x(0) - x 0 


(5a) 


lim 
t— ♦ °° 


x(t) = 0 


(5b) 


Another reasonable assumption is that f(t) is bounded, and it will be found mathematically 
advantageous (and only minimally restrictive) to assume that f(t) is also a dwindling function: 

Um f(t) - 0 ( 5c ) 


A quadratic performance index 


J = I f [x T WjX + w 3 u 2 ]dt 
2 0 

has been chosen, as one that lends itself well to the variational approach to optimal controls, 
since an analytical solution is desired. The upper limit of the definite integral has been selected 
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so as to yield a time-invariant controller. Here Wj is a square 2 by 2 constant weighiug matrix 
while matrix w 3 is a weighing constant. 

Although, Wj could be a full 2 by 2 matrix, for this problem a diagonal form has been 
employed for the sake of simplicity. 


w, - 


w la 

0 


0 

w lb 


( 7 ) 


The performance index consequently reduces to 

PS 

J 3 - | K.xJ ♦ w lb x 2 + W 3 u 2 jdt, 
2 0 


so that each state is weighted independently. 

If sinusoidal motion of the experiment is considered, so that 

x(t) = Bsin wt 


( 8 ) 


and x(t) = w 2 x(t), the cost function can be expressed in terms of the acceleration and control 
as 


00 
1 f 

( w la 

w lb' 

B 2 x 2 + w 3 u 2 

= - 

+ 

2 J 

0 

U 4 

- 2 J 



dt 


( 9 ) 


It is apparent that this performance index convienieutly weights accelerations at low frequencies 
much more than at higher frequencies. 


4. SOLUTION 

Finding the optimal control to minimize equation (4) is a variational problem of La- 
grange, for which the initial steps of the solution are well-known (e.g., Elbert (ref. 3)). Ihe 
variational approach is outlined below, following which the complications added by the non- 
homogeneous term f(t) will be addressed. Current optimal controls texts either assume that 
f(t) will be addressed. Current optimal controls texts either assume that f(t) = 0 (e.g., (ref. 3), 
p. 262) or require that it have a restricted range space (e.g., (ref. 6), p. 238). The solution that 
follows provides an analytical optimal control without imposing such restrictions. 
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The argument of the cost function J from equation (4) is augmented by the Lagrange 
multiplier A times the system equation of motion equation (3) where 


The result J can be expressed as 

j » JH dt 

o 

where the Hamiltonian H is 

II = 1 (x T WjX + w 3 u 2 ) + A T (x - Ax - bu - f) 
2 

4 * 

It is desired to obtain an optimal solution u = u which minimizes J. 
The first variation of J(x,u,x) is 


6J = J 


dH, dH , dH,. 
6x + 6u + — 4x 


dx 


du dx 


kit 


which is set equal to zero to minimize J. However, integrating by parts, 


J |£2ix|dt = * J A T $xdt 




so that the above expression for 63 becomes 


63 - f 

y 

H 

1 

Lx ♦ £I*u 

J 

0 


“ du 


dt = 0 


Doth 6x and 6u are arbitrary variations, so 6 J = 0 only if 



V 


(14a) 


E-A T 

dx ” 

»-o 

d u 


(14b) 


The conditions given by equation (5) still apply. 
Solving eqs. (14a) and (14b) yields 


A = W,x - AA 


u* = J_b‘A 
w. 


(15a) 

(15b) 


* 

Temporarily eliminating u 


produces the result 



where 


A | 1 bb T 

Wo 



(16) 


If equation (16) is now solved for in terms of x and of f, equation (15b) will then furnish 
an expression for the optimal control u . 

As noted before, optimal control texts generally treat the homogenous problem (where 
f(t) = 0), but they do not provide an analytical solution to the non homogeneous system de- 
scribed by equations (5) and (16). Salukvadze has treated the nonhomogeneous problem (refs. 4 
and 5), but this difficult treatment seems largely to have remaned either uncomprehended or 
under-appreciated. This method is especially well suited to low-frequency disturbance rejection, 
and has been applied below to the present problem. 
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The homogeneous solution to equation (15), where f — 0, is 


The four eigenvalues of 


where ^ and are 


and 


The eigenvectors of A 



( 17 ) 


A may be found to be, in ascending order of real parts, 

11/2 


Pi = 


-fii * u>] - ^) 1/2 


= 



Pz " ~Pl 


(18a) 

(18b) 

(18c) 


Pa = -P 2 


defined as follows: 


2k _ _ ow lb 

m rn 2 rnw 3 


(18d) 


(19a) 


h - e )-* 


» J »U + 

2 2 
m w 3 m 


(19b) 


corresponding to the respective eigenvalues fi^ may be chosen to be 
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Pk 


E k 


7 4 + + 7i( 7 2 + Mk) 

/*k 7 3 /*k 7 3 ) 


7i ♦ (7 2 ♦ Mk)M k 
7 3 


(20a) 


where 7 j, 7 2 , 7 3 , and 7 4 , are defined below: 


k 

7i = - 

m 


(20b) 


72 


m 


(20c) 


73 - 


a 


m 2 w 3 


(20d) 


74 - w la 


(20e) 


Using equations (18) through (20) with (17) the solution to the homogeneous system is 


c l e + c 2 e E 2i *■ c 3 e E 3i + c 4 e E 4i 


Ml* 


/l 3 t 


-Mlt 


-Mjt 

% - r 


c l e 2 h + c 2 e E 2? + c 3 e E 3a ” c 4« E 43 


( 21 ) 


with p k 


, k - 1, . . 4 and where Cj, . . ., c 4 are arbitrary constants. 


Application of the variation of parameters method with terminal conditions (eqs. (5b) 
and (c)) leads to the general solution of the nonhornogeneous system, with two constants of 
integration yet undermined. 
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If the two constants of integration are eliminated by solving for A in terms of x and f, 


and A 2 become: 


„ > -fM e -M 3 t 

A 1 - £l x l + ^2 X 2 + ^3 e + ^4 e 

(22a) 

h * £s x l + ^6 X 2 + €?® P,t + 

(22b) 


in which the ^’s are functions of the eigenvalues and eigenvectors of A, and of the disturb- 
ances f(l). 

The Solution Form 

Using the fact that 


u'(t) = _L A T b [cf. eq. (15b)] 


w. 


(23) 


the optimal control is found to be 

u*(t) - + i ? 2 x 2 + i? 3 e J e Mlt f 2 (t)dt + i; 4 e '* 2 J e** 2 f 2 (t)dt ( 24a ^ 


where 


-m 

’ll = 
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- - 
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(24b) 


-m 

n 2 * — 
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— + Ml + /*2 
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(24c) 
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«?3 = - 
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/ \ 
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(Mi - M 2 J 


2 c k 

M j + -1*1 * - 
m m; 


(24d) 
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M 2 + -M 2 + — 

a 

Ui - 
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(24e) 


(It should be noted that the feedback gains T) l and r/ 2 are those which would result from 
applying standard LQR theory to the homogeneous system equation x = Ax + bu). In 
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equations (24a) to (f) p v p 2 are t ^ ie eigenvalues of A with negative real parts, (see eqs. (18a) 
and (b)) 


f 2 (t) - id ♦ Id. 

m m 


(24Q 


By repeated application of the method of integration by parts, the control may be re-expressed 
in terms of an infinite sum: 


u*(t) = 7 jXj + i? 2 x 2 + 73 




r-0 


r+1 


*4 


- (-i)'f| r, W| 


E 

r=0 


r+1 


(25) 


Rewriting f 2 in terms of d and d, the control function becomes 


u*(t) 


1?lX(t) + t) 2 x(t) + 


k ^3 + ^4 
®(/»l H, 


d(t) 
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r-1 
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k-i) 
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d (0 (t) 


(26) 


(- 1 ) 


n-1 c 

ml 


V3 V4 
+ 


‘2j 


d^(t) + higher order terms 


This may be written in more appealing form as 


u*(t) = CpX(t) + c v x(t) + c do d(t) + c dl d(t) + higher order terms 


(27) 


in which the constant coefficients c p , c v , c d0 , and c dl may be defined from equations (24) 
and (26). Clearly, if the infinite sums converge rapidly enough, the optimal control can be 
approximated by 


u*(t) = c p x(t) + c v x(t) + c d0 d(t) + c dl d(t) 


(28) 


For very low frequency disturbances the higher order terms in equation (26) are negligibly small, 
and the control (eq. (28)) closely approximates the optimal. If, in fact, the second- and 
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higher-order derivatives of d(t) are identically zero, the approximation is exact. It can be 
shown that for the critically damped closed loop system the eigenvalues are real and equal, and 
the convergence is more rapid than for the overdamped system. Further, as the closed-loop 
system eigenvalues become more negative the convergence speed goes up as well. 


5. CONTROL EVALUATION 
Physical Realizability of the Control 

The control, equation (25), is physically realizable, if the states and sufficient derivatives 
of d(t) are accessible (or estimable by an observer), and if the higher order terms are negligible. 
It is not necessary that the eigenvalues be real, although the proof of this requires a more 
general linear algebra or state-transition-matrix approach. 

If values are assigned to the system parameters, associated controller gains can be 
evaluated. Suppose that m = 100 lbm, k = 0.3 lbf/ft, c = 0 Ibf-sec/ft, and a = 10 lbf/Amp. 
With w 3 arbitrarily set at 1 and w lb varied, associated integer values of w u can be found 
below which the eigenvalues [i j and /x 2 will always be real. Such values are tabulated in 
table I. Stated otherwise, the tabulated values of the weights Wj a and Wj b are those integer 
values (for the sake of simplicity) for which the closed loop system is closest to being critically 
damped without being undamped. Corresponding controller feedback and feedforward gains (for 
the first five derivatives) are also included. 

The states x(t) and x(t) and the derivatives d^(t), d^(t), and d^(t) are clearly 
available for an Earth-based system. However, in space, the only absolute measurements which 
can be directly available are x(t) and d(t), from which x(t), d(t) and x(t), d(t) are obtainable 
only by successive integration(s). Rearrangements of (28) into 

u*(t) = ( c p + c do) x ( t ) + ( c v + c dl )x(t) ■ CdoK 1 ) - <J ( t )] " c dil*( t ) ^ 


or 

U*(t) - (c p + C d0 )d(t) + (c v + C dl )d(t) + c p [x(t) - d(t)] + c v [i(t) -d(t)] 


obviates the need for one accelerometer, but one accelerometer plus two integrations remain 
necessary for either the platform or the experiment. Since [x(t) - d(t)] (or one of its integrals) 
has not been weighted in the performance index J, experiment drift will be a problem that must 
be corrected either by another control loop or by a change of system states. The latter could be 
accomplished by incorporating an accelerometer attached to the experiment into the state equa- 
tion. Alternatively, one could append an integrator to the plant, include the current i(t) as a 
third state, and optimize the control di/dt. But for the sake of simplicity (i.e., fewer states) the 
former lias been assumed (without development) in this paper. 

The higher order terms of the control (eqs. (25) and (26)) can be neglected, for low 
frequencies, if the eigenvalues and /x 2 are °f sufficient modulus. These eigenvalues, in turn, 


11 



are under the control of the designer^ determined by chosen weights Wj a , an< ^ w 3* ^ * 8 
apparent from equation (25) that u (t) essentially reduces to two alternating power series* For 

a sinusoidal disturbance of frequency w the series form of the control converges for |w/^j| < l(i = 1,2). 

m 

It can be shown that each alternating power series converges like (-1) 

r ■ 0 

frequency disturbances (i.e., small relative to system closed loop eigenvalues) a control formed 
by series truncation very closely approximates the optimal. 

For example, suppose that the normalized frequencies |w//ij| for sinusoidal disturbances 
are less than 1/5, and that only the feedforward control terms c do d(t) and c dl d(t) are included 
with the feedback terms. Even so, the feedforward portion of the truncated control, at any time 
t, will be a current that is still within 4 percent (i.e., (1/5)*) of the feedforward portion of the 
actual optimal. If the normalized frequencies are below 1/10, this approximation error will be 
less than 1 percent. Table 1 shows that the gains c d j of higher order derivatives d (t) (see 
equation (26) for the algebraic representations) are, in fact, quite small. 

In some circumstances there may be design constraints which prevent the designer from 
selecting weights that lead to sufficiently rapid convergence. However, since convergence occurs 
rapidly even for eigenvalues of relatively small modulus ([w/jij| < 1/3), in a great many cases 
the designer will have much latitude in choice of weights. For “low” frequency disturbances, in 
these cases, a control which includes only one or two feedforward terms will be “close to the 
optimal. These frequencies be well attenuated. 

Higher frequency disturbances will also be well attenuated, provided the input-to-output 
transfer functions(s) are at least strictly proper in the Laplace Transform variables. This will 
not be the case for the present problem if more than three feedforward gains (c d g, c d [, c <j 2 ) are 
induded in the control. Practically, this means that only proportional and first-derivative feed- 
forward (equation (25) with r = 0,1 or equation (26) with n = 2) should be added to the 
feedback control terms. As will be seen shortly, however, adding even the proportional 
feedforward terms(s) can dramatically improve the disturbance rejection over that afforded by 
LQR feedback alone. 



Transfer Function and Block Diagram 

Neglecting the higher order terms, the transfer function between input and output 
accelerations or displacements is 


« 2 X(») _ X( S ) . 

c 

- - c di 
(a J 

s + 

/ > 

k 

- ” c d0 

J 


S 2 D(s) 1>M 

f y 

m 

s 2 + 

r \ 

c 

- + c v 

l« J 

s + 

( \ 
k 

- +Cp 

j 


( 31 ) 


and a block diagram of the controlled system can be drawn as in figure 2. 
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Control Stability, Stability Robustness, and General Robustness 


Since the control feedback gains are the same as those obtained by solution of the 
standard Linear Quadratic Regulator (LQR) problem, the closed loop system is stable and 
enjoys the stability robustness characteristics guaranteed by the (LQR) approach to optimality, 
viz., a minimum of 60° phase margin, infinite positive gain margin, and 6 dB negative gain 
margin (ref. 6). Additionally, numerical checks indicate that it enjoys substantial insensitivity, 
or general robustness to uncertainties in k, c, and m, as indicated by table II and figures 3 to 
10. By comparing the Bode plots of figures 3, 5, 7, and 9 (corresponding to controls using both 
LQR feedback (F/B) and proportional feedforward (F/F) with those of figures 4, 6, 8, and 10, 
respectively (corresponding to controls using LQR F/B only), one can see that adding feed- 
forward substantially improves disturbance rejection at low frequencies. For example a 
comparison of figures 3 with figure 4 indicates that the optimal control method described above 
can lead to acceleration reductions of greater than four orders of magnitude for all frequencies. 
This reduction is more than two orders of magnitude below that afforded by LQR feedback 
alone at the lower frequencies, i.e., those most heavily weighted in the performance index. 

The order of the reduction is eventually limited by control cost, of course, probably in 
terms either of actuator-related limitations (such as heat removal or force generation require- 
ments) or of power limitations (especially in a space station environment). The control also 
leads to displacement reductions of the same magnitude, limited in this case by actuator-stroke 
or spatial limitations. Providing a unit of transmissibility for very low frequencies and/or 
weighting /(x - d) in the performance index J, would be steps toward addressing these latter 
limitations. 


Computational Aspects 

A significant amount of algebra was required to solve the two-state problem of this paper, 
and the labor involved increases dramatically with each additional state. However, such 
symbolic manipulators as MACSYMA may be used to ease the workload if a symbolic solution is 
desired. Further, well-known numerical methods exist (i.e., Potter’s method (ref. 7) or Laub’s 
method (ref. 8)) for solving the homogeneous system. These can readily provide the feedback 
gains in numerical form, even for problems with many states. It might be anticipated, then, 
that a numerical method also exists for finding the desired feedforward gains. 

Such is the case, as will be shown in a later paper. 


6. CONCLUSIONS 

This paper has applied an existing method for obtaining an optimal control to the 
microgravity platform isolation problem, for which the disturbances to be rejected are low- 

frequency accelerations. The system was assumed to be representable in the form x * Ax + bu + f, 

with quadratic cost function J = I f (x T W,x + w 3 u 2 )dt and diagonal weighing matrix W r 

2 0 

The resultant control law was found to be simple, stable, robust, and physically realizable. 

Further it was shown to have excellent acceleration- and displacement-attenuation characteris- 
tics, and to be frequency-weighted toward the low end of the acceleration spectrum. 
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The method is extendable to the case for which only relative positions and velocities, and 
absolute accelerations, are available; and can be applied so as to weight relative displacements in 
the performance index. 

The approach as presented is algebraically intensive, but symbolic manipulators can be 
used to ease the algebraic labors. Further, since the method produces feedback gains identical to 
those obtained by the LQR approach to optimality, numerical computation of those gains is 
easily accomplished, even for large systems. The feedforward gains can be found numerically 
with comparable ease. 
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TABLE II. - CLOSED LOOP TRANSFER FUNCTIONS FOR SYSTEM WITH 

DESIGN PARAMETER VALUES 

(k = 0.3, c = 0.00622, and m = 100; Gl, G3, G5, and G7 include both LQR F/B and 
proportional F/F; G2, G4, G6, and G8 include LQR F/B alone. Weighting parameters used 
were, w, = 258, w.. = 10, w a = 1 (table 1).] ' 


System parameter 


m(lbm) 



Gl(s) 


G2(s) 


Closed loop transfer function 
s 2 X(s) 

8 2 D(s) 


j = 0.0000622s + 0.0001 

0.31056s 2 + 4.4675s + 16.0624 

j = 0.0000622s ♦ 0.0300 

0.31056s 2 ♦ 4.4675s + 16.0624 


G3(s) 


0.0000622s + 0.0151 
0.31056s 2 + 4.4675s + 16.0774 


G4(s) = 


0.0000622s + 0.0450 
0.31056s 2 + 4.4675s + 16.0774 


G5(s) 


0.000622s + 0.0001 
0.31056s 2 + 4.4680s + 16.0624 


G6(s) = 


0.000622s + 0.0300 
0.31056s 2 + 4.4680s + 16.0624 


G7(s) = 


G8(s) 


0.000622s + 0.0151 
0.27950s 2 + 4.4680s + 16.0774 

O.OOOG22s + 0.0450 
0.27950s 2 + 4.4680s + 16.0774 





FIGURE 2. - BLOCK DIAGRAfl. 
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FIGURE 8. - OPTIMAL CONTROL WITH FEEDBACK ONLY, SYSTEM DAMPING ESTIMATE POOR. 
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